Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric
نویسندگان
چکیده
In spectral conversion of statistical voice conversion (VC), distance-based measures between the converted and target spectral parameters are often used as evaluation or training criteria. However, even if the same speaker utters the same sentence, the spectral parameters vary utterance by utterance, and thus, spectral distance between utterances still exists. Moreover, the original prosodic features of input speech are often kept unchanged in some VC systems, such those that function in realtime. In such cases, prosody of converted and target speech samples are different, and these differences increases spectral distance. These potential spectral variations are not considered in the conventional evaluation/training criterion. Thus, by constructing criteria that consider this spectral difference improvements in sound quality can be expected. In this paper, we investigate intra-speaker spectral variation between utterances of the same sentence. We also propose a method for predicting this variation from prosodic parameter differences between the corresponding utterances. We conduct experimental evaluations using many speech samples of the same sentence uttered by a single speaker, with results demonstrating that the proposed method effectively predicts the intra-speaker spectral variation from the observed prosodic changes.
منابع مشابه
Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملVoice Conversion Based on Probabilistic Parameter Transformation and Extended Inter-speaker Residual Prediction
Voice conversion is a process which modifies speech produced by one speaker so that it sounds as if it is uttered by another speaker. In this paper a new voice conversion system is presented. The system requires parallel training data. By using linear prediction analysis, speech is described with line spectral frequencies and the corresponding residua. LSFs are converted together with instantan...
متن کاملFirst Steps Towards New Czech Voice Conversion System
In this paper we deal with initial experiments on creating a new Czech voice conversion system. Voice conversion (VC) is a process which modifies the speech signal produced by one (source) speaker so that it sounds like another (target) speaker. Using VC technique a new voice for speech synthesizer can be prepared with no need to record a huge amount of new speech data. The transformation is de...
متن کاملSpectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion
From speech, speaker identity can be mostly characterized by the spectro-temporal structures of spectrum. Although recent researches have demonstrated the effectiveness of employing long short-term memory (LSTM) recurrent neural network (RNN) in voice conversion, traditional LSTM-RNN based approaches usually focus on temporal evolutions of speech features only. In this paper, we improve the con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013